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ABSTRACT 

In  this  report  we  detine  strong  contidence  interval 
procedures  and  discuss  their  properties.  Strong  conti¬ 
dence  means  that  the  reported  contidence  level  is 
achieved  even  conditioned  on  contigurations.  Furthermore 
this  is  true  tor  both  the  Gaussian  and  the  slash  sampling 
situations.  We  will  show  how  such  a  procedure  can  be 
obtained  and  compare  its  pertormance  to  some  popular 
non-parametr ic  contidence  intervals. 


1.  INTRODUCTION 

Robust  contidence  interval  estimation  is  not  a  heavy  researched 
area  ot  statistics.  Around  1945  the  tirst  papers  about  non- 
parametr  ic  methods  appeared.  These  procedures  provide  us  with 
contidence  intervals  which  reach  the  reported  contidence  coetticient 
under  any  —  or  any  symmetric  --  sampling  situation.  In  this  sense 
they  are  robust  ( robustness  ot  validity) . 

In  this  report  we  plan  to  emphasize  (some  more)  the  idea  ot 
having  a  highly  resistant  contidence  coetticient.  In  Section  2  we 

Prepared  in  connection  with  research  at  Princeton  University,  spon¬ 
sored  by  the  Army  Research  Ottice  (Durham).  The  computing  tacilities 
were  provided  by  the  Department  ot  Energy,  Contract  DE-AC02- 
81ER1034 1 . 
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will  introduce  the  formulas  and  methods  tor  a  single,  known  sampling 


situation.  Section 
are  concerned  about 
leads  to  the  choice 
Gaussian . 


3,  then  deals  with  the  case  ot  unknown  shape, 
the  dangers  ot  heavy-tailed  situations  which 
ot  the  slash-shape  as  our  counterpart  to  the 


We 


2.  An  equivar iant  cont idence  interval  procedure  tor  a  location 
parameter . 


A  lot  ot  work  has  been  done  in  the  area  ot  robust  inference 
about  location  and  scale  parameters.  Theoretical  methods  like 
asymptotic  minimax  and  influence  curves  handle  the  case  ot  known 
scale  in  a  convincing  way  and  the  theory  is  widely  accepted.  Hand  in 
hand  with  this  development  went  the  recognition  ot  robustness  as  a 
research  topic  in  applied  statistics.  We  therefore  hardly  have  to 
argue  in  favor  ot  robust  procedures. 

The  need  tor  more  work  connected  with  specific  sample  sizes  is  less 
widely  acknowledged.  The  Princeton  robustness  study  (Andrews  et 
al (1972) )  is  an  early  example  tor  this  kind  ot  research.  There 
several  location  estimators  are  tested  in  a  variety  ot  situations  and 
sample  sizes. 


In  the  following  sections  we  will  discuss 
combines  the  two  ideas  ot  robustness  and  small 
topic  is  confidence  interval  estimation ,  where 
1 imited . 


an  approach  which 
sample  study.  The 
our  knowledge  is  quite 


Let 
problem , 


us  start  our  discussion  with  the  classic 
where 


location  and 


scale 
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^  -  (yry2*y3r---»yn) 

is  a  vector  ot  iid  random  variables  drawn  trom  the  distribution 

F(X  *1  e  R,  o-  e  R+ 

which  is  fixed  and  known  except  for  the  location  p  and  the  scale  cr. 
Let  us  assume  that  the  vector  $  is  ordered,  i.e. 

Vi  <  y2  <  •••  <  yn  • 

From  the  beginning  we  restrict  attention  to  location  and  scale 
equivar iant  procedures .  By  this  we  mean  the  functional  property  ot 
the  procedure  T 

T(s*  +  rl* )  ■  sT(y^)  +  r 

whenever  s€R+  and  rGR,  i.e.  under  certain  canonical  transformations 
ot  the  sample  $ ,  the  value  ot  T  transforms  canonically  too  (I*  stands 
tor  the  vector  in  Rn  whose  components  are  all  1).  Such  functions 
are  best  studied  in  reference  to  location  and  scale  configurations , 
i.e.  the  equivalence  classes  ot  the  equivalence  relation  defined 
on  Rn 

i t  equivalent  to  $  iff  there  is  a  positive,  real  s  and  a  real 

t  such  that  $  *  s(t t  +  )  . 

Any  location  and  scale  equivariant  mapping 

T  Rn  -->  R 

will  be  completely  specified  in  any  configuration  by  fixing  its  value 
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T(?)  &  R  tor  a  class-representing  element  since  by  the  detining 
relation  tor  equivariance 

T(J^)  ■  T(s(t?  ♦  ^))  «  sT(tf  +  t)  *  s(t  +  T(c^)). 

When  doing  numerical  integrations  (with  respect  to  conditional 
distributions  given  the  contiguration)  it  is  advisable  to  choose  the 
class-representing  element  such  that 


cb  -  +1 

and  (since  we  have  ordered  trom  the  beginning)  c^  <  <  ...  <  cn»  A 

good  choice  seems  to  be  index  a  around  ^  and  index  b  around  ^ .  This 
means  that  we  choose  as  our  element  representing  the  contiguration, 
that  sample  which  satisfies  the  two  above  restrictions.  Ot  course 
this  element  will  be  uniquely  defined. 

Another  way  ot  thinking  about  the  class-representing  elements  is  to 
say  that  they  are  base  points  tor  parametrizing  configurations,  which 
are  two-dimensional  classes  ot  samples. 

Any  sample  /  *  (y^ ,y£» . . . ,yR)  determines  uniquely  its 
contiguration  (we  use  the  word  contiguration  also  to  denote  the 
class-representing  elements}  by 
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(tjSiCjiCjf##*  >  tGR,sGR+ 


Remember  that  (c^rc2,...,cn)  ■  i?  is  (n-2)-  dimensional  since  two 
values  are  pre-tixed. 

It  we  have  a  sample  y,  we  know  the  contiguration  c  and  it  can 
be  argued  that  this  contains  all  ot  the  "allowed"  information  —  we 
should  not  be  intluenced  by  the  actual  values  ot  sy  and  t^.  The 
"point  pattern”  ot  the  sample  by  itselt  should  tix  our  ideas  about 
location  and  scale.  J.W.  Tukey's  picture  tor  this  is  a  code  clerk  who 
pre-processes  our  sample  $  and  only  gives  to  us  the  contiguration  & 
onto  which  we  have  to  base  our  inference  ("inference  on  the 
^-scale").  After  handing  him  our  estimated  values  back,  the  code 
clerk  can  use  his  knowledge  ot  sy  and  ty  to  transform  the  answer  back 
to  the  yf-scale,  he  simply  has  to  add  ty  back  in  and  multiply  by  sy. 

The  configurations  partition  the  sample  space  Rn  into  2- 
dimensional  subsets  --  parametrized  by  ten  and  sSR+  --  which  are  the 
largest  sets  where  a  general  location  and  scale  equivariant  mapping 
behaves  perfectly  simply  and  can  be  defined  by  a  single  real  number. 

But  configurations  are  also  connected  with  the  probability 
structure  on  the  sample  space  induced  by  iid  sampling  from 
F(<~1)  y€R  ,  The  idea  works  because  of  the  equivariance 

properties  ot  the  probability  structure,  i.e.  the  canonical  changes 
which  take  place  inside  configurations  it  the  parameter  values  are 
changed . 

Let  kp(t,sly,0’,^)  denote  the  conditional  density  given  the 
contiguration  We  have  to  find  the  Jacobian  ot  the  transformation 
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y .  ■  s(t  +  Cj)  i  »  1 , . . . ,n. 

It  we  take  the  coordinates  ot  y  in  the  order  (y  .y,,y . , . . .  ,y  )  the 

a  D  1  n 

Jacobian  is  equal  to: 

s 

abs(det(  s 
s 

Hence  we  have 

kp(s,t|*i,<r,^)  dsdt  is  proportional  to 


t-1 

t+1  .  ))  -  as0'1. 


.n-1 


n 

n  C  t  ( 

i*l 


(s( t+c^) -p) 


ds  dt. 


i .e .proportional  to 


(I)"'1 


n 

n 

i-1 


U(|(^H  ♦ 


c^  )  } 


where  t  is  the  density  tunction  corresponding  to  F. 

So  kF(s,tlu,o-,^)  ds  dt  «  kp(p,q|0,l,*)  dp  dq  it  we  put 


P  " 


s 

a 


q  .  SllE  .  t  -  H 

^  s  s 

or 


(2.1) 


B  •  pc 
*  *  «  *  & 


We  learn  trom  this  that  the  optimal  choice  ot  location  and  scale 
equivariant  procedures  is  not  dependent  on  the  true  parameter  values 
p  and  9.  By  coding  the  data  in  terms  ot  a  contiguration,  we  have  got 
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rid  ot  the  "true"  parameter  values,  since  any  choice  we  make  on  tht 
^-level  has  to  be  used  tor  all  possible  "true"  parameter  values. 

Figures  2.1  through  2.4  show  a  series  ot  contourplots  in  the 
(t,s)-plane  ot  the  densities  k(s,t|0,l,^)  tor  two  contigurations  in 
twenty  dimensions.  Fig.  2.1  and  2.2  show  the  Gaussian  and  slash  (see 

3.1)  in  a  Gaussian-  drawn,  nicely  behaved  contiguration.  Fig.  2.3  and 
2.4  do  the  same  with  a  slash-drawn  contiguration.  The  contiguration 
is  included  in  these  pictures  with  "•"’s,  the  scale  on  the  t-axis  is, 
however,  not  relevant.  We  can  see  how  the  “outliers"  in  the  second 
case  intluence  the  Gaussian  density,  which  is  pushed  to  small  s- 
values  and  very  stretched  in  the  t-direction.  The  slash  picture  does 
not  exhibit  such  drastic  changes.  Ot  special  interest  is  the 
ditterence  between  the  two  densities  in  any  given,  tixed 
contiguration,  because  a  big  such  ditterence  tells  us,  that  the  two 
sampling  situations  "interpret"  the  data  quite  ditterently  and  lead 
us  to  quite  ditterent  conclusions.  The  contigurations  used  tor  these 
plots  are  : 

For  the  Gaussian-drawn  c • (-1 . 4,-1 . 3, -1 . 2, -1 . 1 ,-l . 0, -. 91 , 

-.76, -.63, -.30, -.14, -.03,. 23,. 43,. 64,. 89, 1.0, 1.5, 1.5, 1.6, 

3.2) 

For  the  slash-drawn  <?* (-13. 8, -6. 7,-1 . 5,-1 . 1 ,-l . 0,-. 90 , -. 65 , 

-. 51, -.39, -.09, -.01, .32, .70, .78, .91,1.0,1.2,1.3,2.8,9.5) . 
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Gauss i an-drawn 


slash-drawn  conf igurat ion 
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estimated  value  on  the  ? -scale  —  such  that 


ave[((T(*)  +  t)s  -  p)2lp,<r,*] 


avel(T<£>  +  t)2s2-2(T(£)  +  t)  sp  +  u2l  p,cr,^] 


r<‘ 


is  minimal .  Hence 


ave[(T(cf)  +  t)  s2  *  ave[sylu,cr,^) 


T,  (^) 

*J  t<r 


or 


4  2  4 

U  ave[s|»i,<r,c  ]-ave[ts  |ptcr,c  } 


ave[s2  ] 


oo  oo 


S  f  s(*j-ts)  k_(s,t  |p,<r,^ )  dsdt 
0  -oo 

oo  oo  , 

r  r  s  k_(s,t|jj,<r,c  )  dsdt 
0  -oo 


oo  oo 


-S  /  P(qP)  kF(p,qlO,l,^)  dpdq 
6  -oo 
oo  oo 

J  /  P  ^c-tP^ql  0  f  1  ,c  )  dpdq 
0  -oo 


(see  1.1) 


-ave[qp2|0,l,^l 
ave[p2|0 , 1 1 


(2.2) 


This  invariance  result  is  intuitively  obvious:  it  a  code  clerk 
gives  the  configuration  &  to  the  statistician  and  asks  tor  a  "good" 
location  estimate,  he  has  to  -.rome  up  with  the  same  answer  no  matter 
what  the  true  parameter  values  n  and  a. 

For  further  discussion  ot  this  problem  see:  Pitman(1938) , 
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X . _ _ _ .  % 


e 


will  always  calculate  our  integrals  with  a  standard  torm 
p  «  0  ,  cr  ■  1  and  otherwise  not  mention  them. 

The  observation  that  the  contiguration  c  contains  all  ot  the 

allowed  information  is  formalized  by  the  statistical  notion  ot 

ancillarity.  The  distribution  across  the  2-dimensional 

configurations,  i.e.  the  frequencies  ot  tailing  into  configurations, 

only  depends  on  the  shape  F(  )  but  not  on  the  actual  values  ot 

p  and  a.  This  is  true  since  the  step  from  the  sample  £  to  the 

contiguration  £  involves  the  subtraction  ot  a  "location"  estimate 

and  division  by  a  "scale"  estimate,  so  that  the  distribution  ot  £  no 

longer  involves  the  parameters  p  and  a  (and  hence  is  pure  shape). 

This  tact  is  used  as  an  argument  to  condition  on  the  ancillary 

statistic  £  —  the  conditional  distribution  contains  the 

"information"  about  location  and  scale  (see:  Fisher (1934) )  . 

These  are  the  ideas  we  want  to  use.  They  greatly  simplify  all 

inference  problems  about  p  and  <y.  Firstly  all  our  mappings  will  be 

uniquely  defined  on  each  contiguration  £  by  a  single  real  number  tc. 

But  secondly  the  choice  ot  tc  is  often  possible  without  reference  to 

other  configurations  and  we  hence  deal  with  a  "simple"  2-dimensional 
* 

problem  no  matter  how  big  the  sample  size. 

2.1.  The  single  situation  confidence  distribution. 

Let  us  fix  a  contiguration  £  and  a  shape  F  (  )  in  standard 
torm,  i.e.  p  ■  0.  Conditioned  on  £  we  deal  with  the  2-dimensional 
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1 

«  (n-3) (n-5) ...1(|)*  it  n  even. 

Now  we  want  to  study  the  ettects  ot  setting  an  upper  bound  tor  the 
parameter  *1.  It  u  is  the  value  ot  the  upper  bound  statistic  U  on  the 
?-scale,  i.e.  U{?)  *  u ,  the  statistic  is  detined  on  the  whole 
contiguration  by 

U(? )  ■  U(s(?+t?))  ■  s(u+t) 

(location  and  scale  equivar iance) . 

We  are  interested  in  the  coverage  probability  detined  by 

CoF(u)  Pp[U(y)  >  01?) 
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It  follows 


Cop(u)  «  Pp[s(u>t)  >  01?)  «  Pp[t  >  -u|?) 


oo  oo 

CoF(u)  ■  f  /kp(s,t|?)  dsdt.  (2.4 

0  -u 

COp(u)  25  the  function  which  tor  a  cho ice  of  an  upper  bound  u  c>n  the 
?-scale  gives  as  its  value  the  conditional  probabi 1 i ty  of  the  upper 
bound  statistic  actually  being  an  upper  bound  tor  the  parameter  jj  on 
the  given  configuration. 

remark : 

Again  COp(u)  is  not  dependent  on  the  values  p  and  a  which  are  used  to 
calculate  the  integrals.  For  a  shape  F(  )  not  in  standard  form  we 
would  have 


CV<r,F(u) 


U.o-.F 


tU(y)  >  m] 


V<r,F(t 


E- 

s 


U] 


oo  oo 


(s,t|*i,<y, 


?) 


dsdt 


From  (2.4)  we  see  that 


COp(u)  -->  1  as  u  — >  oo 


(see  2.1) 


Cop(u)  — >  0  as  u  — >  -oo 
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which  is  consistent  with  our  intuition.  COp(u)  is  like  a  distribution 
function  and  can  be  used  to  define  confidence  intervals  tor  the 
location  parameter  p. 

Whenever  we  choose  L{c)  and  U(cf)  such  that 
COp(U(?))  -  COp(L(?))  ■  .95  we  have  a  2-dimensional  piece  of  a  95% 

confidence-interval  procedure  which  has  exactly  95%  coverage 
probability  conditioned  on  the  configuration.  The  upper  bound  of  the 
interval  is  below  p  with  probability  1  -  Co(U(?))  and  the  lower 
bound  is  above  with  probability  Co(L(?))  and  therefore  the  interval 
covers  the  parameter  with  conditional  probability 

1  -  (1  -  Co(U<?))  ♦  Co(L(?)))  -  Co(U(?))  -  Co  (L  (<f ) ) 

If  we  choose  to  do  so  tor  each  configuration,  we  will  have  a  95% 
confidence-interval  procedure,  since  we  get  the  "overall"  coverage 
probability  by  averaging  the  conditional  coverage  probabilities, 
which  in  the  above  case  are  constant,  across  configurations. 


The  function  Cop{  )  will  be  called  a  coverage  distribution  or  a 
confidence  distribution.  It  has  the  density 


F 


(u) 


COp(u) 


COp(u) 


TkF(s,-u|?)  ds. 
0 


(2.5) 


remark : 

How  does  the  mean  of  the  confidence  distribution  compare  with  the 
minimum-mean-square-error  location  estimate?  The  mean  m  is  equal  to 
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oo  oo 


m  *  f  uf  k«(s,-u|?)  ds  du 
-oo  0 


-  JT  J  t  kp(srtl?)  dsdt  «  -ave(tlcf). 


whereas  the  "optimal"  location  estimate  t  is 


t„  .  -  »ve(ts2l ?)  (,,,2.2). 


avel 


The  "optimal"  estimate  therefore  takes  the  correlation  between  s  and 
t  into  account. 


The  confidence  distribution  is  not  dependent  upon  the  values  of 
the  parameters  p  and  or  used  tor  the  calculation  as  we  have  already 
seen.  Now  we  want  to  explore  the  behavior  under  changes  of  the  class 
representing  element 

Let  us  first  look  at  the  case  where  «  v?,  i.e.  our  representing 
element  is  scaled  by  v€R+.  From  (2.3)  we  have 


_  ,  n 

sn  1  n  t(s(d.+t>) 

i-l  1 


oo  ,  n 

/  sn  1nt(s(d.+t) )  dsdt 
-oo  1 


kf(s,t 


$ ) 


oo 


n-1  n 

■ n  t(s(vc.+t) ) 

_ i^l _ _ _ 

oo  oo  .  n 

J  S  s  n  t(s(vc.  +  t) )  dsdt 
0  -oo  i-l  1 


n-l  n  f 

<vs)n  n  t(v s(c.+£)) 
i-l  1  v 


oo  oo  „  ,  n 

J  J*  (vs)n  1  II  t(vs(c,+i))  dsdt 
0  -oo  i-l  1  v 
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(vs) 


n-  x 


n  t ( v  s ( c . ) ) 

i«l 


n 


oo  .  n 

a  II  i(a(c.+b) )  dadb 

-oo  i=  ’ 


kF(vs,i|? ) 


We  used  the  change  ot  variables: 


a  *  vs  — >  da  *  vds  and  b  -  ^  — >  db  »  ^  -->dadb  -  dsdt 


The  contidence  density  on  the  ^-scale  is  theretore 


oo 


cop(ul^)  «  Jk_(s,-ul?)  ds 
F  ^  F 


oo 


£  kp(vs,-  -^Icf )  ds 


?kp(af-  $\£)  &  -  i  cor$\t). 


We  see  that  the  multiplier  v  behaves  like  a  scale  parameter! 

Now  let  us  consider  the  case  where  c?  •  i.e.  our  representing 
element  is  translated  in  the  (1,1,..., 1)  -  direction  by  w  e  I.  Froi 
(2.3)  we  have 


kp(s,t I  £ ) 


_  ,  n 

sn  A  n  t(s(d.t-t) ) 

_ i-1  1 _ 

oo  oo  .  n 

S  I  Sn~x  n  i(s(d.+t) )  dsdt 
0  -oo  i-1  1 


s"*1  II  ttstCj+w+t) ) 
i-1 


oo  oo  ,  n 

J  r  s  1  n  t(s(c4+w+t))  dsdt 
0  -oo  i-1  1 


sn_1  II  t(s(c^w+t) ) 

_ i-1  1 _ 

oo  oo  ,  n 

J  S  s0”1  n  t(s(c.+b))  dsdt 
0  -oo  i-1  1 


kp(s,t+w|  £ )  . 
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We  used  the  change  ot  variable:  b  ■  t+w  — >  db  *  dt. 
The  contidence  density  on  the  £ -scale  is  therefore 


oo 


oo 


cop(ul^)  «  T  kp( s ,-u|  cf ) ds  «  T  kp(s (u+w)  I  ?)  ds  *  cOp(u+w|<?). 


We  see  that  a  "translator"  w  behaves  like  a  location  parameter.  The 
behavior  under  changes  or  transformations  ot  the  class-representing 
elements  is  again  following  our  intuition: 


COp(ul  v? ) 


(2 


COp(  ul  c  +w? ) 


COp( u+w| C  ) 


example :  Gaussian  case  (F  «  $) 


Using  our  expression  tor  k^(s,tl?)  in  formula  (2.3)  we  get 


COj (u) 


oo 

{ 


n  i  e2  n  , 

sn_Aexp(-%-  5  (c.-u) 

*  i-1  1 


n-1 


(  s  (c.-cr)  l 

i-1 


ds 


(~)  2  (n-3 )  (n-5)  ..(2or(|)2) 


1 

(n-2) (n-4 )  ..(2or<|)2) 


n 


n-1 


(  5  (c.-c)2) 
i-1  1 


n 


n 
2,1 


<T?) 2 (n-3) (n-5)  ..(2or(i)2)  (  2  (c.-u)2) 

n  2  i-1  1 


JL  t  (Hl£) 

s  cn-l '  s  ' 


(2. 


n 


2  1  —  2 

where  s  -  zrrz~TT  5  (c.-c)  and  t  .(  )  is  the  familiar  t-density 
-  n(n-l)iBl  i  n-1 


with  (n-1)  degrees  ot  freedom  in  its  standard  form.  We  see  that  the 
contidence  density  in  this  case  has  a  fixed  shape  across 
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configurations  so  that  we  could  have  chosen  our  class-representing 

elements  in  such  a  way  that  s_  ■  1  and  c  ■  0  in  which  case  the 

c 

confidence  density  would  always  be  t  ,  (  )  in  standard  form. 

n—  l 

Figures  2.5  and  2.6  show  the  functions  co(  )  (see  2.5)  tor  the 
two  configurations  we  used  already  in  Fig.  2.1  through  2.4.  The  red 
function  shows  the  slash  coverage  density  and  the  black  one  the 
Gaussian  coverage  density,  i.e.  a  Student's-t  density  with  19  degrees 
of  freedom.  These  densities  are  calculated  by  using  a  linear  logistic 
fit  to  Co (  ).  It  is  clear  how  outliers  in  a  configuration  greatly 
influence  the  Gaussian  coverage  density.  In  the  first  --  Gaussian 
drawn  case  --  the  differences  are  not  big,  the  slash  density  is  moved 
a  bit  to  the  left,  which  is  reasonable  it  we  look  back  at  the 
configuration. 

Once  we  know  the  conditional  confidence  distributions  tor  a 
given  sampling  situation  F,  we  can  find  a  confidence  interval 
estimator  tor  any  prefixed  confidence  coefficient  100(1  — c( )  %  in  the 
following  way. 

In  each  configuration  <?  we  fix  the  upper  and  lower  bounds 
such  that  the  conditional  coverage  probability  is  equal  to 
100(1-4)%  .  This  implies  that  the  overall  confidence  level  will 
indeed  be  100(1-4)%.  We  do,  however,  still  have  some  freedom  in  the 
choice  of  the  upper  and  lower  bounds.  The  most  natural  choice  treats 
the  left  and  right  tail  of  the  conditional  confidence  distribution  in 
a  balanced  way,  i.e.  we  leave  out  100  ^%  from  each  tail.  This  now 
uniquely  --  except  possibly  in  pathological  cases  --  determines  a 
confidence  interval  estimate. 
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Figure  2.5  :  Conditional  coverage  densities)  same  configuration  as  in  Fig.  1.1  and 


Gauoe i an-drawn 


Treating  the  two  tails  in  a  balanced  way  means  that  we  care  just 
as  much  about  missing  the  true  location  because  the  lower  bound  is 
too  high  as  missing  because  the  upper  bound  is  too  low.  Once  we  have 
tixed  our  idea  about  the  allowed  conditional  missing  probability,  the 
balanced  division  seems  the  natural  thing  to  do. 

example :  Gaussian  case  (F  *  $  ) 

The  coverage  density  in  this  case  is  always  a  tn-1  -density 

with  location  c  and  scale  s_.  The  above  procedure  is  therefore 

c 

the  usual  symmetric  t-interval. 


3.  Stronq  confidence  intervals  for  a  location  parameter:  A 


compromise  between  the  Gaussian  and  the  slash. 


In  this  section  we  want  to  study  the  effects  of  not  knowing  the 
shape  F  of  the  underlying  sampling  situation.  In  order  to  do  this 
we  will  look  at  the  simplest  possible  case  where  we  restrict 
attention  to  two  possible  candidates,  the  Gaussian  and  the  slash. 

The  latter  is  the  distribution  of  the  ratio  of  a  standard  Gaussian 


and  a  unit  uniform  which  are  independent.  The  density  tor  the  slash 


t(xl*i,<r)  *  - j-2 - [ l-exp(— ]  . 

1  2  2(7 
(2w)^(x-n) 

From  the  verbal  description  we  recognize  the  slash  as  a  "continuous" 

mixture  of  Gaussians  with  scales  which  are  like  an  inverse  uniform. 

The  density  shows  us  the  tail  behavior  as  ~  and  is  therefore  like 

x* 

a  t-density  with  one  degree  of  freedom  (  see  Rogers  and  Tukey  (1972)) 
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tor  further  insights) . 

Nonparametr ic  interval  procedures  --  like  the  sign  interval,  the 
Wilcoxon  signed  rank  interval  and  so  on  --  are  an  attractive  choice 
ot  compromise  tor  many  statisticians.  These  intervals  are  guaranteed 
to  reach  the  desired  confidence  tor  all  symmetric  situations  and 
hence  seem  to  solve  the  problem  ot  compromising  between  situations 
once  and  tor  all.  But  we  would  expect  that  seeking  to  put  such  a 
vast  class  ot  situations  under  one  hat  has  its  disadvantages. 
Furthermore  it  is  not  at  all  clear  how  these  procedures  behave 
conditionally  (on  configurations).  All  nonparametr ic  tests  —  from 
which  the  corresponding  intervals  are  derived  —  need  an  argument  ot 
equal  probability  under  permutations.  They  condition  on  the  class  ot 
samples  which  one  gets  from  the  one  at  hand  under  permuting  around 
the  hypothesized  parameter  value.  A  bit  ot  thought  shows  us  that  this 
is  an  operation  which  does  not  preserve  the  configuration.  It  should 
therefore  be  revealing  to  learn  more  about  the  properties  ot 
nonparametr ic  intervals  conditioned  on  a  given  configuration. 

It  has  been  pointed  out  in  the  "robustness  literature"  that  the 
stability  ot  the  contidence  level  ot  nonparametr ic  procedures  is  only 
one  aspect  which  the  statistician  tries  to  keep  under  control  —  this 
property  has  been  named  "robustness  ot  validity".  Another  aspect  ot 
interest  is  the  etticiency  —  which  can  be  expressed  in  various  ways 
—  ot  a  statistical  procedure. 

3.1.  Strong  contidence  intervals 


K 

S 

Ik 


Even  it  robustness  ot  validity  is  our  goal,  we  need  not  stop  at 
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nonpar ametr ic  intervals 


We  might  reasonably  ask  tor  a  confidence  interval  which  has  a 
"robust"  confidence  coefficient  (or  coverage  probability) 
conditioned  on  configurations.  In  our  "two-situation  world"  this 
would  mean  that  the  intervals  have  to  be  big  enough  to  reach 
100(1 — c< ) %  coverage  probability  conditioned  on  each  configuration 
tor  the  slash  and  tor  the  Gaussian.  Of  course  there  will 
usually  not  be  such  an  interval  and  we  have  to  settle  tor  at 
least  100(l-c()%  coverage  probability  conditioned  on  each 
configuration  tor  the  slash  and  the  Gaussian. 

We  already  know  that  over  the  sample  space  the  slash  coverage  of 
Student's  t  interval  is  conservative.  The  above  approach  might  then 
not  be  very  tar  from  the  classical  t-interval.  We  can  also  say  that 
the  solution  to  this  problem  will  end  up  enlarging  the  t-intervals  in 
certain  configurations. 

It  we  view  robustness  as  a  problem  in  stability  of  coverage 
probabilities,  i.e.  a  problem  in  how  sate  is  the  use  of  a  confidence 
interval,  we  are  lead  to  search  tor  rather  long  intervals  and  our 
flexibility  in  choosing  upper  and  lower  bounds  is  quite  restricted  as 
Figure  3.1  tries  to  show. 
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following  strategy.  Our  starting  point  tor  each  contigur ation 
consists  ot  the  two  intervals  which  have  100(1-4)%  coverage 
probability  and  are  treating  the  upper  and  lower  tail  ot  the  coverage 
density  symmetrically.  Figure  3.2  shows  the  conditional  situation. 
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Figure  3.2:  Conditional  coverage  densities  tor  the  slash  and  Gaussian 
situations  and  the  two  symmetric  intervals 


In  order  to  tind  a  "strong*  interval,  i.e.  one  which  has  at 
least  100(l-d)%  conditional  coverage  probability  in  both  situations 
we  will  look  at  the  interval  which  we  get  by  selecting  the  maximum  ot 
the  two  upper  bounds  and  the  minimum  ot  the  two  lower  bounds. 

L  ■  m  i  n  ( Lg  ,  Lg ) 


U  ■  max(Ug,Ug)  (3.1) 

(see  Figure  3.2) 

Clearly  this  will  be  a  strong  interval  which  can  be  tound  relatively 
easily.  There  are  tour  possible  cases  into  which  configurations  can 

tall: 


(a) 

IL,U]  - 

'VV 

<t>) 

[L,U]  ■ 

!L,.US1 

(c) 

U,U]  - 

tL,’Us! 
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[L,U]  -  [Ls,Ug] 


(3.2) 


The  cases  (a)  and  (b)  are  such  that  one  o t  the  two  situations 
dominates  the  other,  and  the  choice  ot  [L,U]  is  more  or  less  the  best 
thing  one  can  do.  In  the  mixed  cases  (c)  and  (d)  however  the  interval 
[L,U]  could  be  shortened  further  and  still  be  kept  strong.  The 
possible  gains  from  such  a  shortening  however  are  small  compared  to 
the  difficulty  in  finding  them. 


We  will  therefore  examine  the  intervals  given  by  (3.1)  for  the 

case  ot  95%  coverage  probability.  Table  3.1  shows  the  percentage  ot 

observed  cases  (a),  (b)  and  (c)&(d)  in  the  different  sample  sizes. 

Table  3.^:  Percentage  ot  observed  cases  (a),  (b)  and  (c)&(d)  (tor  de¬ 
finition  see  (3.2)) 


sample  size 


Gaussian  situation  II 

II 

a)  (b)  (c)  4(d)  II  (a) 


32  |%  16  |% 


I  89  i%  3  4% 


50  |% 


7 1* 


slash  situation 

(b)  (c)t(d) 


70%  II  80% 


II  78% 

II 

-H - 

II  94 


20  |% 


4  i% 


(In  samples  ot  sizes  10  and  20  the  numbers  are  based  on  150  slash- 
drawn  and  150  Gaussian-drawn  configurations,  in  samples  ot  size  5  the 
corresponding  numbers  are  500  and  500!) 

The  message  from  this  table  is  striking.  The  Gaussian  situation 
is  dominating  in  samples  ot  size  5,  where  we  are  most  ot  the  time  in 
a  case  as  shown  in  Figure  3.2  and  where  Student's  t  interval  is  close 
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to  the  "best"  (in  terms  oi  expected  length)  strong  interval 
procedure.  In  samples  ot  size  5  then,  we  do  not  expect  that 
compromising  between  Gaussian  and  heavy-tailed  slash  is  very 
ditticult.  The  slash  intervals  are  overly  optimistic  and  too  short  it 
judged  trom  the  Gaussian  point  ot  view.  The  Gaussian  intervals,  i.e. 
Student's  t  intervals,  on  the  other  hand  seem  to  do  a  good  job  ot 
keeping  the  conditional  coverage  probability  above  95%  even  in  the 
slash  situation.  And  this  is  true  whether  the  samples  are  drawn  trom 
a  Gaussian  or  trom  a  heavy-tailed  shape.  One  might  conclude  that 
maybe  more  serious  challenges  to  the  Gaussian  model  tor  location 
contidence  intervals  have  to  come  trom  less  heavy-tailed  shapes  than 
the  slash. 

Samples  ot  size  10  and  20  behave  roughly  similarly,  but  very 
ditterently  trom  samples  ot  size  5.  In  slash  drawn  samples  the 
Gaussian  situation  mostly  dominates,  but  not  as  overwhelmingly  as  in 
samples  ot  size  5.  In  Gaussian  drawn  contigurations  the  mixed  cases 
are  a  majority,  introducing  the  slash  along  with  the  Gaussian  and 
thus  really  contributes  a  new  point  ot  view.  In  a  lot  ot 
contigurations  it  torces  us  to  acknowledge  the  tact  that  Student's  t 
interval  doesn't  stretch  tar  enough  to  the  right  or  to  the  lett  and 
has  to  be  enlarged.  A  more  detailed  account  ot  the  behavior  ot  the 
"strong"  intervals  trom  a  contigural  point  ot  view  gives  the 
tollowing  results. 

For  samples  ot  size  20,  Figure  3.3  shows  the  conditional 
coverage  probabilities  tor  the  Gaussian  and  slash  situation.  Figure 
3.4  is  the  corresponding  picture  tor  sample  size  10.  We  see  how  the 
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Figure  3.3:  Stem-and-leaf  plots  of  the  cond.  confidence  levels  of 
the  strong  interval  (3.1)  for  150  sampled  configurations  of  size  20 

N  «  150  Median  ■  0.96295  Hinges  ■  0.955*  0.9676 

Oecieal  point  is  3  placed)  to  the  left  of  the  colon 
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Figure  3.4:  Stem-end- leaf  plots  of  the  cond.  confidence  levels  of 


the  strong 

interval  (3.1)  for  150  sampled  configurations  of  size  10 
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smaller  sample  size  moves  us  nearer  to  Student's  t  interval,  which 
would  have  exactly  constant  95t  conditional  coverage  probability  in 
the  Gaussian  situation.  In  the  slash  situation  this  shows  in  a  trend 
to  be  increasingly  "overlong"  in  samples  ot  size  20. 

The  tollowing  table  gives  the  expected  length  ot  these 
intervals . 

Table  3.2i  Expected  length  ot  "strong"  intervals 


1 

1 

Gaussian 

situation 

slash  situation 

size»5 

T 

1 

i 

2.335 

( .0003) 

103.0  (75.6) 

si ze*10 

1 

i 

1.458 

(.0058) 

22.12  (7.54) 

si ze*20 

1 

1 

1.021 

(.0053) 

63.93  (52.2) 

In  comparison  to  Table  3.1  we  can  again  see  what  we  noticed  by 
looking  at  conditional  coverage  probabilities.  As  the  sample  size 
increases  one  has  to  enlarge  Student's  t  interval  in  order  to  have  at 
least  95%  slash  conditional  coverage. 

The  next  three  pictures,  Figures  3.5,  3.6  and  3.7,  show  us 
something  about  the  conditional  unbalance  or  asymmetry  ot  the  strong 
intervals.  Here  we  plot  the  conditional  probability  ot  missing  the 
true  parametervalue  at  the  low  (i.e.  lett)  end  vs.  at  the  high  end. 

Two  things  are  quite  noticable.  In  all  three  cases  most  ot  the 
strong  intervals  are  slightly  asymmetric.  There  is  a  branch  trom 
(0.0,  0.0)  to  (0.025,  0.025)  above  and  below  the  diagonal  — 
which  contains  the  balanced  intervals.  The  correction  trom 
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slash 


3.7:  Plot  of  the  lower  vs.  the  upper  conditional  missing  probabilities  of  the 
intervals  (4.1)  in  the  slash  situation 

strong  intervals  in  samples  of  size  20 
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Student's  t  to  the  strong  interval  is  theretore  usually  one 


sided.  Furthermore  we  notice  how  the  increase  o i  sample  size 


influences  the  picture.  More  and  more  points  get  absorbed  into 


(0.0,  0.0)  --  which  means  they  are  overlong  --  and  the  two 


branches  we  discussed  are  pushed  toward  the  edges. 


The  above  discussion  has  an  interesting  consequence:  maybe  it  makes 


more  sense  to  compromise  with  a  less  extremely  heavy-tailed 


counterpart  than  the  slash  in  samples  of  size  5.  This  finding 


disagrees  with  the  tact  that  with  bigger  sample  sizes  the  distinction 


between  samples  drawn  from  the  slash  and  samples  drawn  from  the 


Gaussian  becomes  more  "obvious"  —  and  that  compromising  these  two 


situations  is  theretore  simpler  in  larger  samples  (see:  Bell  & 


Morgenthaler (1931) ) .  To  try  this  idea  the  "slacu"-distr ibution  was 


used  together  with  the  Gaussian.  This  is  the  distribution  of  the 


ratio  of  a  standard  Gaussian  and  the  cube  root  of  a  unit  uniform  -- 


it's  density  has  tails  like  -^j.  But  still  Student's  t  interval  comes 


out  to  be  very  close  to  the  strong  interval  --  now  strong  tor 


Gaussian  and  slacu!  Table  3.3  shows  the  numbers. 


Table  3.3:  Percentage  of  observed  cases  (a),  (b)  and  (c)&(d)  (tor  de- 
tini tion~  see  (3.2)  note,  however,  that  the  slash  is  replacM  by  th< 
slacu) 


I  Gaussian  situation 


slash  situation 


sample  size  I  (a)  (b)  ( c )  &  ( d )  ||  (a) 


(b)  (c)  & (d) 


I  78  |% 


20  |% 


91  |%  1  5% 


7  !* 


Surprisingly  little  changed  by  replacing  the  slash  with  the  slacu.  It 
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we  want  to  get  robustness  in  the  sense  ot  stable  conditional  coverage 
probability  under  heavy-tailed  challenges.  Student's  t  interval 
behaves  well  in  samples  ot  size  5  and  leaves  little  scope  tor 
moditications. 

3.2.  Conditional  behavior  ot  nonparametric  cont idence  intervals 


As  we  pointed  out  in  the  introduction  to  this  chapter, 
nonparametric  confidence  intervals,  which  are  assured  to  have  a  fixed 
confidence  level  "over  the  sample  space"  tor  all  symmetric 
situations,  need  not  have  a  fixed  coverage  probability  conditioned  on 
the  configuration.  In  this  section  we  will  see  in  a  descriptive 
fashion  how  the  intervals  derived  from  Wilcoxon's  signed  rank 
statistic  and  from  the  sign  test  statistic  behave  conditioned  on 
configurations. 

We  are  interested  —  as  always  —  in  95%  confidence  intervals 

and  ot  course  neither  ot  the  above  procedures  will  be  able  to  create 

a  95%  confidence  interval  tor  samples  ot  size  5,  where  the  interval 

2 

defined  by  the  minimum  and  maximum  ot  the  sample  has  1  -  yj  *  .9375 
coverage  probability.  In  order  to  compare  over  the  full  range  ot 
sample  sizes,  we  will  include  the  "range"  -  interval  tor  sample  size 
5.  For  samples  ot  size  10  and  20  we  use  logistic  interpolation  in  the 
Wilcoxon  and  binomial  tables  to  get  approximately  a  95%  confidence 
interval.  On  the  configuration  scale,  i.e.  expressed  as  intervals  tor 
c^<Cj< . .  *<cn,  we  use  the  following  intervals: 
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sign 


si ze*5 

size«10 

size=20 


[c 

[• 5c2+. 5c 

[.82Cg+.18c7 


interval 

l'c5! 

•5Cg-f.5Cg] 

, .82c15+. 18c 


Wi lcoxon- interval 


[ . 26Wg+. 74Wg , . 74w^7+. 26w^g] 

[. 34w&4+.66w53, .34w157+.66w15g] 


yi  +  yi 

(w^<. . ,<wn (n+i )  denote  the  Walsh  averages,  i.e.  - ^ ^  ordered  by 

,  %  2 
value .) 

It  turns  out  that  the  sign  interval  and  the  Wilcoxon  interval 
have  quite  ditterent  behaviors  as  one  looks  across  situations  and 
sample  sizes.  The  following  table  contains  the  estimated  variation  ot 
the  conditional  coverage  probabilities. 


Table  3.4:  Hinge-spreads  (see  Tukey(1977))  tor  conditional  coverage 
probability  in  % 


Gaussian  I  slash 
I 

size«20  3.50%  I  3.73% 
sign  intervals  size*10  3.66%*  I  2.58%* 

si ze*5  2.44%  I  5.35% 


size*20  1.25%  I  3.32% 

Wilcoxon  intervals  size*10  1.30%*  I  4.00%* 

size«5  2.44%  I  5.35% 


(*:  Entries  tor  sample  size  5  are  tor  the  " range"- interval ) 

The  sign  interval  procedure  is  getting  worse  in  the  slash 
situation  from  samples  ot  size  10  to  20  as  tar  as  stability  ot  the 
conditional  coverage  level  is  concerned.  The  Wilcoxon  interval  seems 
to  improve.  On  the  whole  it  is  surprising  how  little  the  increase  in 
sample  size  stabilizes  the  conditional  coverage  levels. 
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Both  interval  procedures  are  bad  tor  very  small  samples  in  the 
heavy-tailed  slash  situation.  Looking  across  situations,  the  two 
confidence  intervals  are  complementary  as  well.  The  sign  interval  is 
more  stable  in  the  slash  tor  the  intermediate  sample  size  —  whereas 
the  Wilcoxon  interval  is  better  in  the  Gaussian.  It  we  adopt  a 
measure  ot  variation  which  uses  more  ot  the  information  in  the  tails 
ot  the  distribution  --  like  the  standard  deviation  --  the  ettects 
come  out  even  more  clearly.  They  can  also  be  seen  in  Figures  3.8  and 
3.9,  which  show  box  plots  (see:  Tukey(1977))  ot  the  conditional 
coverage  probabilities. 

In  these  Figures  we  use  the  logistic  transforms  ot  the  conditional 
coverage  probabilities  p,  defined  by 

log  it  (p)  -  1°g(TT^)--^)  (3.3) 

so  that,  on  the  transformed  scale,  a  value  ot  zero  corresponds  to 
exactly  95%  conditional  coverage,  whereas  positive  values  indicate 
conditional  coverage  bigger  than  95%  and  negative  values  indicate 
coverage  smaller  than  95%.  We  hope  that  the  distributions  will  be 
made  more  symmetric  by  this  re-expression. 

In  these  pictures  we  can  also  see  how  the  numbers  ot  Table  3.4 
came  about.  The  Wilcoxon  interval  in  samples  ot  size  10  produces 
quite  low  conditional  slash  coverage  probabilities  in  some  "extreme" 
configurations.  This  is  clear  from  the  tact,  that  information  from 
the  smallest  and  largest  observation  are  sometimes  used,  since  they 
can  contribute  up  to  the  10th  and  46th  Walsh  average.  The  median  ot 
the  conditional  coverage  probabilities  can  be  quite  substantially 
bigger  than  95%  (the  mean  over  configurations) .  This  reflects  the 
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Figure  3-8: 

Logistic  transforms  for  150  sampled  configurations  of  the  sign  interval 


cond.  coverage  for  sign  in  samples  of  size  10  &  2C 


20  in  Couttian  10  in  Gou»»ion  20  in  10  in  »lo»h 


logistic  traneformt 


Figure  3.9: 

Logistic  transforms  for  150  sampled  configurations  of  the  Wilcoxon  interva 


cond.  coverage  for  Wilcoxons  in  samples  of  size  10  &  2 


20  in  CouMion  10  in  CouMion  20  in  •  lo»h  10  in  •  1 c 


tact  that  the  distribution  o t  the  conditional  coverage  probabil i ti es 
is  skewed  towards  low  values. 

Asymptotically,  these  ditterences  between  contigurations  we  have 
observed  here,  disappear.  It  we  repeatedly  take  large  samples  trom  a 
tixed  shape  F(  ),  the  contigurations  we  tall  into  will  usually  be 
relatively  similar  and  the  conditional  coverage  probability  tor  this 
situation  F(  )  will  theretore  be  nearly  constant.  All  the  comments  we 
have  made  are  hence  about  phenomena  observed  trequently  only  in  small 
samples. 

Figures  3.10  through  3.13  show  the  conditional  probability  ot 
missing  the  true  location  parameter  at  the  lower  and  upper  tail  tor 
the  two  nonparametr ic  intervals  and  the  two  situations  in  samples  ot 
size  10.  It  becomes  clear  now  that  the  two  situations  are  indeed 
quite  ditterent.  Both  procedures  create  rather  unbalanced  intervals 
in  the  slash  situation  —  the  Wilcoxon  more  so  than  the  sign.  Many  ot 
the  Wilcoxon  intervals  are  too  long  in  one  direction  and  otten  too 
short  in  the  other.  The  lines  drawn  in  the  pictures  correspond  to 
intervals  with  exact  95%  conditional  coverage  probability.  Most  ot 
the  intervals  are  below  this  line  —  and  are  theretore  overlong.  The 
tewer  points  above  the  line  still  bring  the  overall  coverage  to  the 
nominal  95%.  In  Gaussian  samples  the  Wilcoxon  clearly  does  quite  a 
good  job  and  is  substantially  better  than  the  sign  intervals,  which 
show  a  similar  behavior  as  in  the  slash  situation.  These  plots  look 
similar  in  samples  ot  size  20.  In  that  case  they  are,  however,  a  bit 
more  concentrated  tor  both  situations. 

We  can  summarize  what  we  have  learned  up  to  now  by  saying  that, 
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Figure  3.12:  Plot  of  the  lower  vs.  the  upper  conditional  missing  probabilities  of  the 
Wilcoxon  interval  in  the  slash  situation 


Figure  3.13:  Plot 
Wilcoxon  interval 


Gaussian 


even  though  both  the  sign  and  the  Wilcoxon  intervals  reach  a  tixed 
contidence  level  across  the  whole  sample  space  in  all  symmetric 
situations,  there  are  ditterences  as  soon  as  we  look  at  the 
conditional  contidence  levels.  The  sign  intervals  are  more  stable 
than  the  Wilcoxon  intervals  it  we  sample  trom  the  heavy-tailed  slash. 
The  opposite  is  true  it  we  believe  strictly  in  the  Gaussian  model.  As 
a  compromise  between  these  two  extremes,  we  would  probably  choose  the 
Wilcoxon. 

The  second  important  thing  about  contidence  intervals  besides 
coverage  probability  is  the  length  distribution.  Table  3.5  gives  the 
estimated  mean  lengths  and  coetticients  ot  variation. 

Table  3.5:  Estimated  mean  length  and  estimated  coetticient  ot  varia- 
tion 


1 

Gaussian  situation  || 

i  a 

slash 

situation 

sign  intervals 

size«20 
si ze»10 

1 

1 

1 

1 

1.16 

1.64 

26.5%  || 

29.0%  || 

II 

2.65 

5.39 

23.3% 

75.2% 

Wilcoxon  intervals 

si ze*20 

T 

1 

.955 

"II 

17.5%  II 

3.34 

57.1% 

si ze*10 

1 

1.46 

25.0%  II 

24.51 

476% 

It  is  quite  obvious 

that  the 

Wilcoxon 

intervals  are 

extremely  long 

extreme  contigurations  ot  size  10.  Otherwise  this  table  contirms  our 
view  that  the  sign  intervals  are  to  be  tavored  in  the  slash  and  the 
Wilcoxon  in  the  Gaussian.  This  tact  also  appears  in  an  asymptotic 
theory  via  Pitman  etticiencies.  The  square  root  ot  these  etticiencies 
applies  to  asymptotic  ratios  ot  mean  lengths.  The  values  tor  the 
Gaussian  and  slash  are  summarized  in  Table  3.6. 
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Table  3.6:  Square  roots  ot  ratios  ot  Pitman  etticiencies 


methods  I 

1 

Gaussian 

II 

II 

slash 

r 

sign  vs.  Wilcoxon  1 

.8165 

IT 

II 

1.082 

sign  vs.  Student's  t  1 

.7975 

II 

oo 

Wilcoxon  vs.  Student's  t  1 

.9772 

II 

oo 

We  can  check  how  well  these  asymptotic  numbers  work  by  comparing  them 
to  the  small  sample  estimated  ratios,  as  we  do  in  Table  3.7. 


Table  3.7:  Estimated  ratios  ot  mean  length 


II 

1  1 

Gaussian 

|  slash 

1 

size*  10  |( 

.8901 

1  4.55 

sign  vs.  Wilcoxon 

size*20  II 

.8233 

|  1.2604 

asympt  II 

.8165 

1  1.032 

II 

size*10  II 

.8514 

! 

1  13.47* 

sign  vs.  Student's  t 

size*20  II 

.7966 

1  53.71 

asympt  II 

.7975 

!  oo 

II 

size«10  II 

.9566 

i  * 

15.14* 

Wilcoxon  vs.  Student's  t 

size*20  H 

.9675 

1  65.24 

1  oo 

asympt  II 

.9772 

*:  theoretical  value  is  oo 

A  comparison  between  the  two  sets  ot  numbers  shows  that  the 
asymptotic  expression  is  a  good  approximation  as  tar  as  samples  ot 
size  20  are  concerned,  but  tor  sample  size  10  the  agreement  is  not 
good  except  in  "Wilcoxon  vs.  Student's  t" . 

Both  nonparametr ic  intervals  we  have  considered  up  to  now  do  not 
leave  us  entirely  happy  and  a  procedure  between  these  two  extremes 
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might  well  be  a  better  compromise.  There  are  at  least  two  ways  in 
which  one  can  bridge  the  sign  and  Wilcoxon.  One  was  suggested  by  J. 

W.  Tukey  and  it  works  by  trimming  observations  irom  the  ordered 
sample  and  computing  Wilcoxon' s  signed  rank  statistic  on  what  is  lett 
over,  thus  omitting  certain  i  and  j  entirely.  The  other  (Policello 
and  Hettmansperger (1976) )  works  by  "winsorizing"  the  scores  in 
Wilcoxon's  signed-rank  statistic.  This  is  equivalent  to  omitting 
Walsh  averages  with  |i-j|>bound.  (It,  tor  example,  we  put  the 
extreme  bound  ot  1  on  the  ranks,  the  sign  statistic,  i.e.  the  number 
o£  positive  observations,  comes  out.) 

Both  ot  these  procedures  can  be  explained  in  terms  ot  the 
triangle  ot  Walsh  averages. 


(1)  trimmed  Wilcoxon  (2)  winsorized  scores 
Both  only  take  the  Walsh  averages  in  the  corresponding  shaded  region 
into  account.  These  methods  are  expected  to  do  a  bit  worse  in  the 
Gaussian  than  the  Wilcoxon  but  to  improve  in  the  slash. 

It  turns  out  that  in  samples  ot  size  10  the  procedure  which 
trims  the  largest  and  smallest  observation  very  nearly  gives  the  same 
contidence  intervals  as  the  procedure  which  puts  a  bound  ot  3  on  the 
ranks.  Both  are  worse  in  the  Gaussian  situation  than  the  Wilcoxon, 
which  is  no  surprise  —  but  they  seem  to  be  rather  close  to  the  sign 
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intervals  and  even  a  bit  worse  in  terms  ot  stability  ot  the 
conditional  coverage  probabilities.  In  the  slash  situation,  they  are 
quite  close  to  the  sign  intervals.  In  terms  ot  expected  length  these 
"robust"  intervals  are  an  improvement  (see  Morgenthaler  (1983)). 

In  samples  ot  size  20  the  two  proposed  moditications  ot  the  Wilcoxon 
test  statistic  produce  nearly  identical  results.  Now  these 
* robusti t ied"  rank  intervals  really  make  an  improvement .  In  the 
Gaussian  situation  they  are  between  the  Wilcoxon  and  the  sign,  in  the 
slash  situation  they  improve  over  both  ot  them.  Instead  ot  having  a 
hinge-spread  ot  contidence  levels  across  contigurations  ot  roughly  3% 
(see  Table  3.4),  these  procedures  go  down  to  about  2.5%.  Figure  3.14 
shows  a  boxplot  similar  to  the  Figures  3.8  and  3.9. 

We  would  theretore  recommend  the  use  ot  these  contidence  intervals 
which  bridge  the  gap  between  the  sign  and  the  Wilcoxon  procedures  tor 
larger  sample  sizes  because  in  a  heavy-tailed  case  they  show  an 
improved  behavior. 

4.  Conclusions. 

Strong  contidence  intervals  are  appealing  trom  the  point  ot  view 
ot  validity  ot  an  interval  estimator.  Their  conditional  contidence 
coetticients  given  any  contiguration  is  kept  above  the  nominal  level 
tor  both  situations.  We  only  took  two  situations  into  consideration, 
but  this  is  already  a  conclusive  case.  It  we  sateguard  against  heavy 
tailedness  in  the  way  described  in  this  report,  we  do  not  have  a 
ditticult  job.  Student's  t  interval  is  already  resistant  as  tar  as 
validity  is  concerned.  But  ot  course  many  statisticians  will 
criticize  Student's  t  interval  as  being  too  long  in  contigurations 
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with  outliers.  The  research  described  shows  us  the  need  tor 
compromising  between  validity  and  some  measure  ot  efficiency.  It  we 
only  consider  validity,  we  automatically  sacrifice  efficiency  in  a 
rather  extreme  way.  We  considered  three  sample  sizes  (u  *  5,  10  and 
20)  and  it  is  rather  striking  how  the  problem  ot  compromising  between 
shapes  changes  with  changing  sample  size.  For  samples  ot  size  5  it 
turns  out  that  Student's  t  intervals  are  nearly  "optimal".  As  the 
sample  size  increases  this  no  longer  holds.  In  order  to  be  strong. 
Student's  t  interval  needs  to  be  enlarged  (usually  in  only  one 
direction) . 

In  the  last  section  ot  this  report  we  examine  non-parametr ic 
confidence  intervals.  It  becomes  clear  that  a  stable  overall 
confidence  coefficient  does  not  ensure  a  good  behavior  ot  the 
conditional  confidence  coefficients. 
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